Searching of Gapped Repeats and Subrepetitions in a Word
نویسندگان
چکیده
A gapped repeat is a factor of the form uvu where u and v are nonempty words. The period of the gapped repeat is defined as |u|+ |v|. The gapped repeat is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its period. The gapped repeat is called α-gapped if its period is not greater than α|v|. A δsubrepetition is a factor which exponent is less than 2 but is not less than 1 + δ (the exponent of the factor is the quotient of the length and the minimal period of the factor). The δ-subrepetition is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its minimal period. We reveal a close relation between maximal gapped repeats and maximal subrepetitions. Moreover, we show that in a word of length n the number of maximal α-gapped repeats is bounded by O(αn) and the number of maximal δ-subrepetitions is bounded by O(n/δ). Using the obtained upper bounds, we propose algorithms for finding all maximal α-gapped repeats and all maximal δ-subrepetitions in a word of length n. The algorithm for finding all maximal α-gapped repeats has O(αn) time complexity for the case of constant alphabet size and O(n log n + αn) time complexity for the general case. For finding
منابع مشابه
Optimal searching of gapped repeats in a word
Following (Kolpakov et al., 2013; Gawrychowski and Manea, 2015), we continue the study of α-gapped repeats in strings, defined as factors uvu with |uv| ≤ α|u|. Our main result is the O(αn) bound on the number of maximal α-gapped repeats in a string of length n, previously proved to be O(α2n) in (Kolpakov et al., 2013). For a closely related notion of maximal δ-subrepetition (maximal factors of ...
متن کاملEfficiently Finding All Maximal alpha-gapped Repeats
For α ≥ 1, an α-gapped repeat in a word w is a factor uvu of w such that |uv| ≤ α|u|; the two factors u in such a repeat are called arms, while the factor v is called gap. Such a repeat is called maximal if its arms cannot be extended simultaneously with the same symbol to the right or, respectively, to the left. In this paper we show that the number of maximal α-gapped repeats that may occur i...
متن کاملEfficiently Finding All Maximal $α$-gapped Repeats
For α ≥ 1, an α-gapped repeat in a word w is a factor uvu of w such that |uv| ≤ α|u|; the two occurrences of a factor u in such a repeat are called arms. Such a repeat is called maximal if its arms cannot be extended simultaneously with the same symbol to the right nor to the left. We show that the number of all maximal α-gapped repeats occurring in words of length n is upper bounded by 18αn, a...
متن کاملImproved Upper Bounds on all Maximal $\alpha$-gapped Repeats and Palindromes
We show that the number of all maximal α-gapped repeats and palindromes of a word of length n is at most 3(π/6 + 5/2)αn and 7(π/6 + 1/2)αn − 5n− 1, respectively.
متن کاملGapped Pattern Statistics
We give a probabilistic analysis of parameters related to α-gapped repeats and palindromes in random words, under both uniform and memoryless distributions (where letters have different probabilities, but are drawn independently). More precisely, we study the expected number of maximal α-gapped patterns, as well as the expected length of the longest α-gapped pattern in a random word. 1998 ACM S...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014